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1. Introduction 

In this paper we study the asymptotic properties of a class of unbiased location 
invariant Hill-type heavy tailed index estimators. Let {X n ,n > 1} be an inde- 
pendent and identically distributed (i.i.d) sequence with distribution function 
(d.f) F(x) belonging to the domain of attraction of an extreme value distribu- 
tion, 



as n — > oo for all x £ K. We write F £ -D(G 7 ) if (1.1) holds and call 7 the tail 
index. 

The applications of the heavy tailed distributions may be found in many 
fields such as insurance, finance, climatology and environmental science (cf. 
Embrechets et al. (1997)), and the problem of estimating the heavy tailed index 
has been studied extensively. For positive 7, Hill (1975) introduced the well 
known Hill estimator, which is 



distributions 



G 7 (x) = cxp { -(1 + ~ix)- lh 1 , 7 G 1 and 1 +72: > 0, 



i.e. there exist normalizing constants a n > and b n £ R such that 
F n {a n x + b n ) — > G y (x) 



(1.1) 
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where X^ n < X2. n < ■■ • < -X"n,n are the order statistics of X\,X2, • • • ,X n . 
As an extension of Hill-type estimator for general 7 € M, Dekkers et al. (1989) 
proposed the following moment estimator 

where 



fc-i 

M^(fc) = ^ ^ (In - lnX„_ fc>n ) j ,j = l,2. 

i=0 

In applications both Hill and moment estimators are sensitive to the thresh- 
old, say X„_fc jn , and these two estimators are not invariant to the affine transfor- 
mation. Fraga Alves (2001) established the scale and location invariant Hill-type 
estimator given by 

H KQ 1 y 

/ 7 7 \ \ T 1 -A-n — i,n ^n— k,n 
In (k ,k) = -j— ln Y ~~Y _ ' 

*A-n— ko,n -™-n— k,n 

where the intermediate sequences fco and k satisfy 

k = k n = o(n), fco = o(k n ), k n — ► 00, fco — » 00, as n — * 00. (1-2) 
The asymptotic properties of 7n(&0j k) have been considered when 
U(tx)/U(t)-x-f ^x p -l 

— ^ X 1 

a(t) p 

holds as i — > 00, where C/(t) = i ?M- (l — l/t), t > 1, where F*~(x) denotes the 
inverse function of F(x). Based on the methods provided in Fraga Alves (2001) 
and Dekkers et al. (1989), Ling et al. (2007a, 2007b) proposed a kind of loca- 
tion invariant moment-type tail index estimator and considered its asymptotic 
properties under some second order regular varying conditions. 

Meanwhile the problem of searching for the unbiased estimators has been con- 
sidered by many authors, see, e.g., Peng (1998), Beirlant et al. (2002), Caeiro 
and Gomes (2002), Gomes and Martins (2002, 2004), Peng and Qi (2006a, 
2006b), Gomes et al. (2008), Gomes and Henriques (2008), and Qi (2008a, 2008b). 
Here we are interested in the class of semi-parametric heavy tail estimator in- 
troduced by Caeiro and Gomes (2002): 

where T(-) is the gamma function and 

fc-i 

k 



M n\k) ( mX n-i,« - lnX„_ fc ,„) Q , a > 0. 



i=i 
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They considered the asymptotic distributional representation of Mn a \k) and 
the choice of tuning parameter a such that 7„ (fc) is asymptotically normal 
with asymptotic null bias under the assumptions that 

hxU(tx) -]nU(t) -7 In a; x p - 1 

W) x> 

and y/kA(n/k) — > A, finite. 

Motivated by the works of Fraga Alves (2001) and Caciro and Gomes (2002), 
we propose a new class of location invariant estimators for a heavy tailed dis- 
tribution based on the asymptotic distributional representation of the following 
statistic: 

(fco, k) = ^ (in " *T fc - T, a e R+. (1.3) 

z— 

The asymptotic distributional representation of Af„ (fco, fc) will be derived un- 
der the following second order regular variation condition: 

U(tx)-U(t) _ g7_i 



lim a(t) 2— = * 7jP (x) for all x > 0, (1.4) 



where 



7+P _ J 

if 7 + p ^ 0, 



.In a;, if 7 + p = 0. 

and p < is the second order parameter and \A(t)\ G RV P (cf. Corollary 2.3.5 
of de Haan and Ferreira (2006)). Here, / <E RVp means lim^oo f(tx)/f(t) = x@ 
for all x > 0. 

Based on the convergence M« (fco, fc) T(a + 1)7° as fco — * 00, fco = o(fc), a 
location invariant Hill-type estimator for the heavy tailed index may be defined 

by 



ln ( "°' fc) -^(fco,fc) 



Af,l 2a) (fc ,fc) 



r(2a+l) 



a>l, (1.5) 



which converges to 7 in probability. Finite sample simulation shows that the 
positive tuning parameter a may be chosen appropriately to improve the per- 
formance of tail index estimation in applications. 

The paper is organized as follows. In section 2, we provide the main results, 
i.e. the asymptotic distributional representations of (1.3) and (1.5), and the 
optimal choice of the sample fraction fco by mean squared error (MSE) for some 
special distributions. Related proofs are deferred to section 4. Simulation studies 
are performed in section 3. 
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2. The main results 

Throughout this paper, we assume that Xi,X2, ■ ■ ■ ,X n are i.i.d random vari- 
ables (r.v.s) with d.f F(x), and denote by X\^ n < X2, n < • • • < X n>n the order 
statistics of X±, ■ ■ ■ ,X n . For the heavy tail distribution, i.e. 7 > 0, we know 
that 

F G D(G 7 ) 1 - F e RV- lh U G RV 1 . (2.1) 

We need the following notations to simplify the statements of our main re- 
sults: 

(Ja = v / r(2a+i)-r 2 (a + i), 



,_r(a) l-(l-p)° 

_ 1 f r(4a) 4r(2a - 1) 2r(3a) 
Q 4\ar 2 (2a) T 2 (a) a r(a)r(2a) ' 



and 



b a {l) = {l + l) l - a -\{l + l)- 2a ~\. (2.3) 

The first result is about the asymptotic distributional representation of 
M n a \k ,k). 

Theorem 2.1. Suppose that (2.1) holds for 7 > 0, and the intermediate ko 
and k satisfy (1.2). Then M n a \ko, k) converges in probability to T(a + 1)7". 
Furthermore, if the second order framework in (1.4) holds, we may obtain the 
following asymptotic distributional representation 

M^(k ,k) i 7 «r( a + l) + ^F( Q )+a7> Q (-7)(y) 7 (l + op(l)) 
+ B n , 

where P n a ^ is an asymptotically standard normal r.v., and 



-A[n/k) I — ) (l + o P (l)), j/ 7 + p ^0, 



7 + p \ k 



Based on Theorem 2.1, we may derive the asymptotic distributional repre- 
sentation of the proposed estimator j£*\ko, k) in (1.5), which is the following 
result. 
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Theorem 2.2. Under the conditions of Theorem 2.1, the following asymptotic 
distributional representation 



7l Q) (fco,fc) = 7 + 



n — n 



Rn 



op 



1 



(2.4) 



holds, where is asymptotically standard normal, and 



7 



7 + P 
7& Q (7)A(n/fc) 



( J 



hi 



(l + o F (l)), i/7 + p^O, 
(1 + op(1)), z/ 7 + P = 0. 



Furthermore, if there exist finite Ai and A2 suc/i i/iai 



Ai and ykoA(n/k) — > A2, 



^ ( 7n a) (fc , fc) - 7) - JV (Ai6 Q (7),7V Q ) 



where V a and 6,2(7) are defined in (2.2) and (2.3), respectively. Consequently, 
for every 7 > ; £/iere exists ao given by 



ota = a (7) 



In (l + 7 + v / ( 1 +7) 2 -l) 



ln(l + 7 ) 

such that £> Qo (7) = ; i.e. 7 n a °^(fco, £;) ^ as asymptotic null bias, even when 
Vfco(fco/fc) 7 -> Ai ^ and v/fe4(n/fc) -> A 2 7^ 0. 

For special A(i), we will consider the optimal choice of the sample fraction fco 
as a function of k, 7, p and a following the criterion of Fraga Alvcs (2001) to com- 



pute ko = ky PT such that the asymptotic MSE (Fraga Alves, 2001) of 7! (fco, fc) 
denoted by MSE 00 (j n a \ko 1 fc)) is minimal. Note that an asymptotic MSE is just 
the usual MSE based on some asymptotic relationship. For example, suppose 
9 n is an estimator of 9 based on a random sample of size n and that it satisfies 
y/(in~(6 n — 6 — b„) — > N(fi, a 2 ) as n — * 00 for some a„, b n , p and a 2 . Then the 
asymptotic MSE of 9 n denoted by MSE ao (9 n ) is simply {b n + pj y/a^} 2 + a 2 /a n . 

Theorem 2.3. Suppose that (1.4) holds with A(t) ~ ct p , p < and c ^ 0. Set 



"0 

(2) 


(3) 



2^(7) 



2^ + 1 2-y 



-2c 2 pb 2 a (-p) 
cryb a (-p) 



-2p 



n-^p* 1 ; 



• k~/+p ■ n~i+p , 
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where V a and 60(7) ore defined as before. Then we may obtain the sequence k^ 
that minimizes M S E^^yi (fco, k)), 

i. For 7 < — p, fcg pt = Kq ; 

ii. For 7 > —p, 

(a) . Ifk « „-P(27+l)/{7(-2p+l)} ; k opt = fc (l). 

(b) . J/fe > n-pV-r+V/M-zp+i)}, jfcgP* = ifcb a {-p)b a {~f) < and k° pt = 
k { 3) ifcb a (-p)b a (y) > 0; 

(C). Ifk ~ £ n -P(27 + D/{7(-2p+l)} ^ £, ^ Qj then k opt _ Din _ 2p/( _ 2p+ l) 

wi/i D\ = I?i(7, p, a, c, £)) satisfying 

aiD/ +a 2 D{ H +a 3 D 1 H ="fV a , 

where oi - 2 7 ^ ( 7 )£)-27 ; fl2 = [ 2c7 ( p _ 7 )/( 7 + p)]& a ( 7 )& a (-p)Z)-r and a 3 = 
-2p[ C 76 a (-p)/( 7 + p)] 2 . 

Corollary 2.1. Suppose (1.4) holds for A(t) ~ ct p twzift. p < 0, c 7^ 0, and i/ie 
intermediate sequence k satisfies k — ► 00, fc/n — > as n — > 00 /or 7 — p: 
or k -> 00, fc n -p(27+i)/{7(-2p+i)} ^ /or 7 > -p. Choosing k ~ fc^ = 

[ 7 F a 6- 2 ( 7 )/2] 1 /(27+l) . fc 2 7 /(2 7 + l) ; we ftaue 

V^F (¥ n a Hk { o\k) - 7 ) - JV (A i0q (7),7 2 ^) 
™t/i 60,(7) and V a defined as before. 



3. Simulation study 

Firstly, we present some 0(0(7) f° r heavy index 7 such that 0a (7) = 0. 
Table 1 shows that 00(7) is a decreasing function of 7. Firstly we consider the 
effect of the tuning parameter a on the heavy tail index estimator proposed in 
this paper. We randomly select a sample from Frechet d.f F(x) = cxp(— x^ 1 / 7 ) 
with 7 = 1. For 7 = 1, choose ao = 1.90 such that 6 Qo (7) = (cf. Table 1). Wc 
compare the sample paths of 7% , 7n and j n 3 ^ with sample size n = 3000. 
Figure 1 shows that 7n 1 ' 90 '' has a much smaller bias than others. 

For the optimal sequence kg, we denote ko := argminfc MSE(j^ (ho, k)) and 
fco '■= argmin/cp M SE(^j n (k$, k)). Consequently, we get the following possible 
measure of asymptotic relative efficiency (AREFF) of the proposed location 
invariant estimator 7^ = j„ (ko(ot),k) to the Hill-type location invariant 
estimator 7^ = 7^ (fc , fc) (bias-corrected). 

AREFF^ ~ S] = y a -7/(27+D . [ 7 /{(i + 7 ) |6 Q ( 7 )|}] 1/(27+1) (3.1) 
whenever 60,(7) 7^ 0. 



Table 1 

00(7) a function of 7, i.e. 0:0(7) = { a '■ bai'f) = 0}. 



7 





0.1 


0.5 


0.75 


1 


1.5 


2 


2.5 


3 


4 


00 


ao (7) 


00 


4.65 


2.37 


2.07 


1.9 


1.71 


1.60 


1.54 


1.49 


1.42 


1 
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Fig 1. Sample paths ofjn\ko,k), 7l 1 ' 90 ^ (fco, fc) and 7^ 3 '(fco,fc) for Frechet (1) distribution 
with sample size n = 3000. 
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Fig 2. Asymptotic efficiency of relative to 7^ /or 7 = 0.5, 1 and 2. 



Figure 2 shows the AREFF^ a ) for 7 = 0.5, 1 and 2. Simulation shows 
that for every 7, we may find some a on the left region of «o such that the 
AREFF in (3.1) is greater than one. 

Next we compare the relative efficiency of the proposed location invariant 
Hill-type estimator and that of Fraga Alves (2001) in terms of average mean 
and MSE for finite sample size. We consider the following two models: 

- Burr (a, j3) distribution with d.f F(x) = 1 — (1 + x a ) 13 , where x > 0, a > 
and (3 > 0. 

- Pareto (7) distribution with d.f F(x) = 1 — x~ 1 ^ 1 , where x > and 7 > 0. 
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■T>0' k > 

. ,,( 2 - 37 ) 



(k ,k) 



Fig 3. Mean value and MSE ofy^(k ,k), ~ti ao) (k ,k) andj£'(k ,k) for Burr (2,1) model 
with 7 = 0.5 and sample size n = 1500. 
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Pareto(2) 
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Fig 4. Mean value and MSE 0/7^(^0, k), 'y^°\ko,k) and 7£ 1 ^(fco,fc) for Pareto model with 
7 = 2 and sample size n = 1500. 



Figures 3 and 4 show the simulated mean value and MSE of 7jf(fc ,fc)> 
ln\ko,k) and 7r» (&o,&) for the distributions just mentioned. For both dis- 
tributions, "f n a °\ko, k) has a small bias and is closer to the real extreme value 
index value (see Figures 3 and 4). 
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Table 2 

Coverage probability comparison for 95% asymptotic confidence intervals based on (3.2) 

and (3.3). 





7 


?(*0.*) 


7 




7 


?(*o,fc) 


7n - ( fc 0,fc) 




7 


r^o,*) 


n 


Burr 


Burr (2,1) 


Frechet 


Frechet (1) 


Pareto 


Pareto (2) 




(2,1) 






(1) 


(2) 






200 





7020 





9860 





6860 


0.9920 


0.9370 





9700 


300 





6930 


U 


ncen 

yoou 





7000 


0.9810 


0.9310 


U 


yoou 


400 





7030 





9820 





6870 


0.9880 


0.9140 





9650 


500 





7165 





9730 





7060 


0.9800 


0.9250 





9620 


600 





7085 





9780 





7320 


0.9800 


0.9060 





9430 


700 





7100 





9770 





7350 


0.9690 


0.8980 





9490 


800 





7130 





9820 





7440 


0.9750 


0.9260 





9570 


900 





7115 





9740 





6970 


0.9610 


0.9020 





9380 


1000 





7220 





9745 





7320 


0.9770 


0.9120 





9400 


1100 





7245 





9760 





7170 


0.9580 


0.9050 





9370 


1200 





7295 





9700 





7050 


0.9640 


0.8910 





9150 


1300 





7255 





9665 





7200 


0.9620 


0.8940 





9280 


1400 





7230 





9615 





7400 


0.9730 


0.9020 





9320 


1500 





7400 





9640 





7070 


0.9620 


0.8970 





9310 


1600 





7180 





9610 





7290 


0.9700 


0.8730 





9150 


1700 





7455 





9665 





7270 


0.9630 


0.8760 





9280 


1800 





7310 





9615 





7330 


0.9760 


0.8930 





9110 


1900 





7160 





9605 





7350 


0.9610 


0.8940 





9110 


2000 





7325 





9620 





7250 


0.9600 


0.8900 





9080 



Lastly we compare the coverage probability and the confidence length based 
on the asymptotic normality of the two location invariant Hill estimators. Note 
that Corollary 2.2 of Fraga Alves (2001) provides us the 1 — 6 asymptotic con- 
fidence interval of 7, i.e. 

[ 1 + Zg/2/Vko 1 ~ Zg/2/Vk J 

We can also construct a new asymptotic confidence interval of 7 based on Corol- 
lary 2.1 by choosing ao such that 6 Q „(7) = for given 7. The asymptotic con- 
fidence interval of 7 is defined by 

w -»-l a-'c-ff , ^°w> }, (3.3, 

1 1 + (v aa /k Q ) 1/2 z e/2 1 - (V ao /k ) 1/2 z e/2 J 

where fcrj m (3.3) is the same as the one in (3.2) and zg/ 2 is the critical value 
of the standard normal distribution at level 8/2; that is, 1 — $(z e / 2 ) = 8/2. 
We drew 2000 random samples from the Frechet (1), Burr (2,1) and Pareto (2) 
distributions. The simulation was repeated 2000 times, we computed the cover- 
age probabilities of 7^(0.95) and the interval lengths for n = 200, 300, • • • , 2000. 
These coverage probabilities and interval lengths are reported in Tables 2 and 3. 

We may conclude that the two estimators have comparable coverage probabil- 
ities only for Pareto distribution. Generally, the proposed estimator 7„ (fcp, k) 
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Table 3 

Average length comparison for 95% confidence levels based on (3.2) and (3.3). 







7i J (fc ,fc) 


7« (fco,*0 


% >(k ,k) 






n 


Burr 


Burr (2,1) 


Frechet 


Frechet (1) 


Pareto 


Pareto (2) 




(2,1) 


(1) 


(2) 


200 


0.6680 


6.4595 


0.6672 


6.4281 


1.1188 


3.4414 


300 


0.5360 


2.9143 


0.5378 


2.9436 


0.9174 


2.5042 


400 


0.4670 


2.1501 


0.4681 


2.1292 


0.8098 


2.1103 


500 


0.4206 


1.7727 


0.4226 


1.7903 


0.7366 


1.8655 


600 


0.3895 


1.5558 


0.3879 


1.5590 


0.6826 


1.7042 


700 


0.3643 


1.4038 


0.3637 


1.4130 


0.6461 


1.5912 


800 


0.3442 


1.2775 


0.3423 


1.2887 


0.6120 


1.4909 


900 


0.3291 


1.2146 


0.3305 


1.2292 


0.5856 


1.4322 


1000 


0.3148 


1.1447 


0.3138 


1.1255 


0.5616 


1.3519 


1100 


0.3034 


1.0865 


0.3030 


1.1009 


0.5484 


1.3237 


1200 


0.2928 


1.0485 


0.2948 


1.0565 


0.5282 


1.2601 


1300 


0.2840 


1.0014 


0.2839 


0.9973 


0.5134 


1.2306 


1400 


0.2754 


0.9734 


0.2762 


0.9698 


0.5013 


1.1996 


1500 


0.2685 


0.9418 


0.2693 


0.9373 


0.4871 


1.1606 


1600 


0.2625 


0.9182 


0.2618 


0.9084 


0.4755 


1.1283 


1700 


0.2551 


0.8808 


0.2568 


0.8926 


0.4648 


1.1024 


1800 


0.2505 


0.8671 


0.2502 


0.8478 


0.4572 


1.0817 


1900 


0.2460 


0.8453 


0.2452 


0.8363 


0.4482 


1.0618 


2000 


0.2407 


0.8237 


0.2409 


0.8234 


0.4395 


1.0421 



has better coverage probability than that of 7^(fc , k). But it has a wider con- 
fidence interval length than that of 7^(fco> k) especially when the sample size n 
is small. Meanwhile, Figure 2 tells us that there exists «o such that jn (ko, k) 
has asymptotic null bias and better asymptotic efficiency than 7^ (ko, k). This 
new location invariant Hill type estimator may be useful in empirical analysis. 



4. Proofs 



Before proving the main results, we need some lemmas. 

Lemma 4.1. Suppose that (1-4) holds for 7 > and p < 0, then for any e, 
8 > 0, there exists to = to(e, 6) such that for all t > to and x > 1, 



In 



U(tx)-Ujt) 
a{t) 



In D 7 (x) 



A(t) 



< e I + 7 



"i+p+S 



where D 1 (x) = (x 1 — l)/7 and 



B iA x ) 



7 x^ +p - 1 
7 + p af — 1 

7 



a;7 - 1 



lnx, 



ifj + p^O, 
if 7 + P = 0- 
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Proof. We only consider the case of 7 + p ^ 0. Note that by (1.4) we have 



U{tx)-U{t) 



1 



7 x 



7+P _ I 



This implies 



lim 

t — >oo 



7 + p a;T - 1 



a(t) 



B 7 , p (x) 



(4.1) 



since ln(l + — > 0. By Theorem 2.3.6 in de Haan and Ferreira (2006), 
for any e, 6 > 0, there exists to = £o(£j 8) such that for all t > to and x > 1, 



U(tx)-U(t) _ n / x 
a(t) u lW 



A(t) 



< ex 



7+P+5 



Hence, 



lnS^-hi? 7 (x) 



A(t) 



, _ / U(tx)-U(t) _ , 

111 a{t)D^(x) \ a(t)D-,(x) 



D 7 (x) 



A(t) 

x)-b 
a(t) 



Alt) 



< e 1 + 7 



7+P+5 



XT - 1 



The proof is complete. 

Lemma 4.2. Under the condition of Lemma 4-1, for x > 1, y > \ we have 



□ 



, U(tx)-U(t) _ , ^-1 
Um m U{ty)-U{t) 111 

t — >oo 



A(t) 



F-r, P (x,y). 



Moreover, for any e, 6 > 0, there exists to = to(e, 6) suc/i i/iai for all t > to and 
x > y > 1, 



, U(tx)-U(t) _ , g-,(x) 
m U(ty)-U(t) 111 D^fo) 



4(f) 



F i, P (x,y) 



where 



x i+p _ 1 y 7+P _ 1 
7 + /) \ i^-l y 7 -l 
lnx : my, 



xi - 1 



if 7 + p ^ 0, 
*/ 7 + P = 



and T 7iP (x, y) = e(2 + 7 {^+"+ 5 + yT+P+«}/{y7 _ 1}). 



J. Li et al. /Unbiased location invariant Hill-type estimators 



8 10 



Proof. We only consider the case of 7 + p ^ 0, similar arguments for 7 + p = 0. 
Note that (4.1) implies 



lim 

t — >oc 



, U{tx)-U(t) 
111 U(ty)-U(t) 



hi 



x'<-l 
y-l-X 



7 



A(t) 



X l+P _ 1 y 7+P _ 1 



7 + p V a; 7 - 1 



for x > 1, y > 1. For any e, 8 > 0, there exists to = to(e,S) such that for all 
t > to and x > y > 1, 



, U(tx)-U(t) 



In J ^ r > 

m 5^5 



A(t) 



F ~t, P (x,y) 



< 



111 



A(t) 

»pi-lni3 7 (x) 



A(t) 



+ 



in 



A(t) 

U(ty)-U(t) 
o(i) 



B 7 ,p(x) 



ln£> 7 (y) 



A(f) 



B iAv) 



□ 



By using Lemma 4.1, we get the desired result. 

Proof of Theorem 2.1. For the asymptotic distributional representation of 
Mn a \k , k), we only consider the case of 7 + p 7^ 0. By Lemma 4.2, we have 

[/(te) - U(t) 



In 

= In 



x 1 — 1 7 



J/ 7 - 1 7 + P 



= 7 In h In 1 

2/ 



X 7+P _ 1 y 7+P _ 1 

a; 7 - 1 y 7 - 1 

7 - a;" 7 



^(*) (l+o(l)) 



i-ir 



7 



7 + p 



yP -y ~< 



1 - X-T 



i-r 



A(t) (l + o(l)). 



Let Yi, I2, • • • , be i.i.d Pareto r.v.s with Fy(y) = l—l/y,y> 1 and let Yi jn < 
^2,n < • • ■ < ^n,n denote the order statistics of Yi, Y2, • • • , Y„. Now replace t 
by Y„_ fc ,n, X by Y n -i, n /Y n -k,n and y by Y n -h ,n/Yn-k,n, respectively, and note 
that {X^ =l = {U(Yi)}™ =l , (Y n _ hn I Y„_ fc , n )" 7 < (Y„_ feoi „ / Y n _ k , n )-~< -» 
in probability uniformly for i = 0, 1, ■ • ■ , ko — 1, and {Y n _j j „/Y n _fc 0jn }k.2.Y 1 = 

{Yco-^olfe 1 - SO, 
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L n — i.n yi -n — k,n 



d ln U(Y n -j, n ) - U(Y n - k , n ) 

n t/(y n _ fco ,„) - u(Y„_ Kn ) 



^ 111 



Y 



n—ko ,n 



7 



7 + P ^ 

Y i ^ p 

1 n — KQ ,1 



l - 



J n—i,n 



(i + op(i)) 



7 m *fco-i,fco + 

7 



1 - 



Yn — k(),n 



Y n — ko,n 
P 



A(Y n - k , n ) {1 + P (1)) 



1-Y 



k — i,k 



~] (! + <*(!)) 



7 + P 



1 - Yf 

k -i,k 



By Taylor's expansion, we may get 
M« (k ,k) 

a 



7 A 


[ko 


-i,k J 


u 




-p " 


it 





A(n/k) (l + o P (l)). 



1 fc ° _1 f 



+ a 7 Q (lnn _ 4 , fc0 ) 
07 



l - y~ 7 . 

a — 1 fco — "ij&o 



fc 



;(lnn - 



7 + P 
-(l-Y ka - lM p ) 

+ A 2 (n/k)0 P ( 



1-Y 



hQ—i,ko 



(1 + op(1)) 



k 

2-, 



A(n/k)(l + o P (l)) 

2 P / I, \ 



Note that £[(ln Yi) Q ] = r(a + 1), l/ar[(ln Yi) Q ] = er Q and 



E 



{lnY 1 ) a - 1 (Y 1 ^'-l)/p =fx a {p). 



So, 



P w = ^S£iO°yQ a -r(a + i) 



iV(0,l). 
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By (4.2) and the law of large numbers, the asymptotic distributional represen- 
tation of Mn a \ko, k) is 



7 + P 

which is the desired result. 



/j, a (p)A(n/k) 



k 



(1 + op(1)), 



(4.3) 
□ 



Proof of Theorem 2.2. Note that both {Xj}f =1 = {J7(^-)}?=i and {^n-i.n / 

^«-fc ,n}ta 1 = { Y k -i,k }i=i 1 nold - For 7 + p ^ 0, by using the conditions of 
Theorem 2.2, following joint expansion in distribution, 



(fc ,fc) V /2 



T(a) 



(W n ,Z n ) 



(4.4) 



holds, where 



0~2a Pi 



(2a) 



M2a(-7) fko 



2r(2a+ 1) ^/T Q 2r(2a) V k 

(1 + op(1)) 



PP>2a{p) j\( n \ ( 



and 



,1-a 



2( 7 + p)r(2a) Vfc 

/ <7 Q _i P,T /io,_l(-7) /fc \ 7 



r(a) v/fco r(a-i) V fc 



pp a ~l{p) 



n\ ( kg 



(7 + p)r(a- 1) \kJ \k 

Note that (4.4) may be deduced from Wold device and Delta-method, as for 
arbitrary a,b € R, we have 

a (aI^ (fc , fc)/r(2a + 1)) ^ + bT(a) (k ,k)) ~* 



1 1 



fco-l 



t/(yn-i,n)-^(yn-fc,n) 
r(2a + 1) fc ^ V'" (^n-fco,n) - ^ (^n-fe,n) 

U(Y n ^ n )-U{Y n 



2a 1 



1/2 



m ) ( 1 Win u ^-^ 

\ k o hi \U(Y n _ k0 , n 



) - (Y„_k,„) 



,(2c 



1/2 



\^r(2a+l) 
aW n + bZ n . 



+ bT(a) (Q£-»)' 



1 



1 + (1 - ZnT - 1 ) 
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The last step follows since Z n 4 7 1 "" and h(l - Z n ^ a ~ x ) - h{0) = ft'(O) 
(1 + op(l)) (1 - Znl"" 1 ) with /i(x) = 1/(1 + where 



j fco-1 r 

r ^ ^ ( 7 iny feo _. t , fco ; 



+ /3 7 ^lny, _ 4 , fco f- 1 



fc -i,fc / ft 



+ — — (lnlfc _i,fen) 

7 + P 



(1 + op(1)) 



fc 



(1 - yfeo-i,fco P ) 



2- 



i4(n/fc)(l + op(l))| 

- 2 P / I.. \ T-P N 



for /3 > 1. So, 
ji a) (k ,k) = Z n W n 



i , 1 f g 2a „(2 ) _ p(g-l) 

\2T{2a + l) n r(a) " 

/ M2a(~7) Mo-l(-7) 



V 2r(2a) r(a - 1) J \ k 

P ( M2a(p) _ Ma-i(p) 
7 + p^2r(2a) r(a-l) 



.4 



(i) 



(1 + op(1)) 



Op 



Denote 

Q { n } 



g 2a p(2a) a a-l p(a-l) j,(a) _ <-rfn 

2r( 2a+ l) - Tea) « ' « -y^^- 



Let fk (t) denote the characteristic function of Q„ . Noting the expression of 
P^ a) in (4.2) and that of V a in (2.2), we have 



f ko (t) = Eaq>\it 



0"2c 



p(2a) _ j /a-l pfa-l) 

i""2r(2a + l) " r(a) n 



£exp| ft v an^) 2a -r(2a + l) 

Iv^ft 2r( 2a + i) 
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/fco 



k a 

~[Eexp 

3=1 
ko r ,2 



it f (In Y,-) 2 " - T(2a + 1) (lnY,-)"" 1 - T(a) 



2T(2a + 1) 



I» 



(\nY/f a -T{2a + 1) (In y^)"" 1 - r(a) ' 



2r(2a+ 1) 



r(a) 



1 



2fc 



fco 



exp(-^ Q /2). 



So, Qn Q ^ converges in distribution to a normal r.v. with null bias and variance V a 
and Tn"^ is an asymptotically standard normal r.v. Noting "f[n2a(— 7 ) /{2r(2a)}— 
Ma -i(-7)/r(a-l)] = 6 a ( 7 ) and { 7 p/( 7 +p)} [i"2«(p) / {2r(2a)}- McK _ 1 (p)/r(a- 
1)] = —{7/(7 + p)} b a (—p), we obtain the asymptotic distributional represen- 



tation of 7 i Q ^(fco> fc)- The remaining part of Theorem 2.2 is immediate. 



□ 



Proof of Theorem 2.3. Assume that A{t) ~ ct p with c / and p < 0, then 
according to the range of the pair (7, p), we need to consider the following two 
cases. 

(i). The 7 < — p case. Combining (2.4) and Theorem 2.2, we have R n = o((fco/fc) 7 ) 
and 



7< a) (fc o ,A0=7+^TM+M7) 



and 



MSE^^ (fc , *0) = 



7 X 



Op 



1 



/ fco 



op 



V 7 
fc 



So, the sequence of ko(n) that minimizes MSEooffin (fco, fc)) is 



2"; 



LOP* _ 7,(1) 



7^a 



2T+1 2-y 
• fc 2 T + l 



.2^( 7 )_ 

(ii). The 7 > — p case. Noting the expression of R n in (2.4), we have 
R n -c 7 6 Q (-p)/(7 + p) (fc /") _p (1 + o P (l)) and 



o P 



1 



Clbg{-P) ( fco 

7 + p \ n 
fco 



0P| 1 » 



(4.5) 



We need to investigate the related weights in (4.5). Consider any sequence fco 
satisfying fc = 0{k^/^ +p ^ ■ n^^+^), then (fc /fc) 7 = O((k /n)- p ). Moreover, 
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if k < fcT/(7+p). n P/(7+p) j thcn (fc /fc)T < (fc /n)- p ; if fc > fc7/(7+/>) . n p/(7+p) ; 

then (fco/fc) 7 3> (ka/n)~ p . In order to obtain the optimal choice of fco, we have 

to consider the relationship of k and n. 

(a). Firstly, suppose fc 2 7/(27+i) < n -2 P /(-2 P +i) ) then 



^<C n - 2 p+! j 



If the sequence fco satisfies fc = 0(fc 2 ^/( 2 ^ +1 )), then = O((fc /fc) 7 ). More- 

over, if fc < fc 2 ^/(^ +1 ), then l/ v / ^> (fco/fc) 7 , 1/VEb » (h/k)- p , and MS^ 
(t1 q) (fc , ft)) = 7 2 V Q /fc is a decreasing function of fc ; If fc > fc2 7 /(2 7 +i) ; 
then (fc /fc) 7 » (fco/fc) 7 » (V™)~ p and MSE X (j n a) (k , fc)) = b\ (7) 

(fco/fc) 27 is an increasing function of fco. So. we choose fco = 0(k 2 ' ) '/( 2 7+i)) 
in order to balance the bias and variance of the estimator 7^ (fco, ft), hence 
MSEntffrHko,*)) = j 2 V a /k + & 2 ( 7 )(fc /fc) 27 and 



h opt 



.(1) 



.26 2 (7) 

(6). If fc27/(27+l) > n -2p/(-2p+l) j wc hayc 



■ fc 2 7+l , 



7 P , 2 7 / -2p 

fc^+p • n-i+p ^> k 2 i+ 1 ^> n -2p+i 



In this situation, if k = 0(n~ 2 P/(- 2 P +1 1), then = O((k /n)- p ). Moreover, 

if fc < n -2p/(-2p+i) 5 t h cn > (fco/n)-/" > (fc /fc)T and MS^ (7^ 

(fc , ft)) = 7 2 V Q /fc ; if n -2p/(-2p+i) < fc < fc7/(7+P) . n p/(7+p) ; then (ko/n)- p 
» 1/Vfco", (ko/n)~P » (fc /fc) 7 and MSE^ (7^ (fc , fc)) = [c 7 fc a (-p) / (7+p)] 2 
(fc /n)~ 2p ; if fc > k^+d ■ n P^+P\ then (fc /fc) 7 > (k /n)-P > 

Choosing the sequence fco = 0{n^ 2p /^ 2p+1 ^) to balance the bias and variance 
is possible. Then 



l 2 V a 
fco 



(rfb a (-p) 



7 + P 



-2p 



We get the locally minimized MSE for the sequence 

(l + p) 2 V a 



h opt 



fc (2) - 



-2pc 2 b 2 a (-p) 
But if any sequence fco = 

0(fc7/(7+P) . n p/h+p)) and 



-2p 

n-2p+! . 



MSEooiffi (fc ,fc)) 



6a(7) 



C)b a (-p) / fco 

7 + p \n 



(4.6) 



we need to see the sign of cb a (^)b a (—p). Note that if cb a (j)b a (—p) < 0, (4.6) is 



an increasing function of fco, and fco Pt = k^ 1 is the solution to the optimization 



(2) 
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problem. On the other hand, suppose c6 a (7)6o,(— p) > 0, let k^ pt = k^ be such 
that M S E oc (j n a \ko, k)) — in (4.6). Then, the optimal sample fraction is 



1.(3) _ 
ft — 



(c). Lastly, we consider k ~ z) n -p(2i+i)/{i(-2 P +i)} with d ^ 0, i.e. ^7/(27+1) 
= 0(n- 2 P/(- 2 P + ^). We derive that fc7/(7+p) . n p/h+P) j s f the same order 
of either /c 2 t/( 2 t +1 ) or n~ 2p ^~ 2p+1 \ and l/V^o is of the same order of either 
(fco/fc) 7 or (ko/n)~ p . Hence, fcg pt must be the sequence such that 



ko 



attains its minimum, which enables us to identify kQ Pt ~ Z?i?i 2p /( 2 p+!) with 
Z?i = f 1(7, p, a, c, D) satisfying 



a\D 1 



27+1 



a 3J Dr 2 " +1 =7 2 ^, 



where 01 = 2 1 b 2 a { 1 )D~ 2 -< , a 2 = [2c 7 (p - 7)/ (7 + p)]6 Q ( 7 )6 Q (-p)Z?-T and a 3 = 
-2p[ C7 6 Q (-p)/( 7 + p)] 2 . □ 
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